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Abstract 

We introduce an agent-based model of interaction, drawing on the con- 
tingency approach from Luhmann's theory of social systems. The agent 
interactions are defined by the exchange of distinct messages. Message se- 
lection is based on the history of the interaction and developed within the 
confines of the problem of double contingency. We examine interaction 
strategies in the light of the message-exchange description using analytical 
and computational methods. 

1 Introduction. 

Contingency is a key component of the theory of social systems elaborated by 
the German sociologist Niklas Luhmann. The theory describes a vision of so- 
ciety as a self-reproducing (autopoietic) system of communicative interaction. 
It is based on the work of Talcott Parsons (sociology), Edmund Husserl (phi- 
losophy), Heinz von Foerster (cybernetics), Humberto R. Maturana/Francisco 
Varela (evolutionary biology), and George Spencer Brown (mathematics). The 
first comprehensive elaboration of the theo ry ca. n be dated to the appearance 



of Soziale System e ( "Social Systems" ) in 119841 with an English translation 
available in 1 199,4 and found a kind of finalization with the appearance of Die 
Gesellschaft der Gesellschaft ("The Society's Society") in 1997. The most rel- 
evant texts for the modeling of learning as a social process can be found in 
Social Systems chapter 4, "Communication and Action," (p. 137-175) and in 
chapter 8, "Structure and Time" (p. 278-356). For readers who are interested in 
an exposition of Luhmann's id eas we recommend for a start Einjiihrung in die 



Systemtheorie (|Luhmannll2004l ) and the February, l200ll issue of Theory, Culture 
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and Socie ty, parti cularly the contributions "Nikl as Luhma nn: An introduction" 
by Jakob Arnoldil and "Why systems?" by Dirk Baeckei . 

The present paper is the first step of a common interdisciplinary work in- 
volving a sociologist and theoretical physicists. Our ultimate goal is to design a 
multi-agent model using the theoretical background of Luhmann's theory and 
based on a mathematical formalization of a social process in which expectation- 
driven communication results in learning. We especially want to explore knowl- 
edge diffusion and interactive knowledge generation (innovation) in various so- 
cial structures, represented essentially as dynamically evolving graphs. Primar- 
ily, we adopt a general point of view of modeling from theoretical physics and 
only secondarily intend to have an "as realistic as possible" model. Rather than 
putting "everything" in, the focus is to distinguish a few features in Luhmann's 
theory that are relevant for our purposes, and propose a model of expectation- 
driven interaction that is a possibly rough first step, but is tractable either on 
an analytical or computational level. In particular, it is promising to use an 
approach based on dynamical systems theory and statistical mechanics to an- 
alyze in£de]s_of_sociBJJiito some research activity in this 
field (iFortunato and Stauffeil l2005t IStauffeij[200i IStauffer et aLll2004l: IWeisbuch 
I2OO4I : Weisbuch et al.l 2005f) . On the one hand, dynamical systems theory pro- 
vides mathematical tools to study the dynamical evolution of interacting agents 
at the "microscopic" level (specifically, the detailed evolution of each agent is 
considered). On the other, statistical physics allows in principle a description at 
the mesoscopic level (groups of agents) and macroscopic level (the population 
as a whole). 

However, though statistical physics gives accurate description of models in 
physics, one must attend to the fact that interactions between human beings 
are more complex than interactions usually considered in physics: they are non- 
symmetric, nonlinear, involve memory effects, are not explicitly determined by 
an energy function, etc. The present paper is an attempt to consider social 
mechanisms beyond state-of-the-art modeling. 

We introduce a mathematical model portraying interaction as an alternating 
exchange of messages by a pair of agents where memories of the past exchanges 
are used to build evolving reciprocal expectations. The resulting "interaction 
sequence" is quite rich and complex. In fact, the pairwise interactions between 
agents depend on various independent parameters, and tuning them can lead to 
drastic changes in the model properties. In this work, we give some results in 
this direction, based on mathematical as well as computational investigations. 
These results are a necessary foundation for handling the intended multi-agent 
case. 

The present model is a gener alizat ion of the work of Peter Dittrich, Thomas 
Krohn and Wolfgang Banzhaf (2003). Based on Luhmann's theory of social 
systems, they designed a model which describes the emergence of social order. 
We follow them in portraying communicative interaction as an alternating ex- 
change of messages by a pair of agents. We use the memory structures they 
defined as a starting point for our agents. The memories are the key to the 
formalization of the structure of expectation in their model and we take the 
same approach. However, we introduce an additional form of memory to allow 
individual variation in the expectations of the agents. 

The main body of the paper is divided into four sections. First the conceptual 
background is elaborated, followed by the mathematical definition of the model. 
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Next, the main mathematical and simulation results are discussed. The text 
ends with an outlook on planned extensions of the model. 



2 Conceptual background. 

The concepts from information theory and Luhmann's social theory which we 
use to model expectation-driven interaction are: 

1. Information and meaning structure, 

2. Interaction, and 

3. Double contingency and expectation-expectation. 

These concepts will be integrated in a model which explores interaction strate- 
gies using different types of interconnected memories. The interconnectedness 
of the memories is a precondition for obtaining agents "capable of acting." 



2.1 Information and meaning structure. 

The concept of information used in this paper ori ginate s with the mathematical 
theory of communication of Shannon and Weaver ( 1949h . now known as informa- 



tion theory. First, information is defined as bei ng produced when one message 



is chosen from a set of messages (|Shannonlll948[ ). The minimum requirement of 



information production is therefore a set of two messages and a "choice" (usu- 
ally, the number of messages is higher, but finite). Second, information sources 
are characterized by entropy: 

That information be measured by entropy is, after all, natural when 
we remember that information, in communication theory, is associ- 
ated with the amount of freedom of choice we have in constructing 
messages. Thus for a communication source one can say, just as he 
would also say it of a thermodynamic ensemble, 'This situation is 
highly organized, it is not characterized by a large degree of random- 
ness or of choice — that is to say, the informati on (or the ent ropy) is 
low. (|Weaveij|l949l. pg. 13) 



This means that the greater the freedom of choice and consequently the 
greater the amount of information, the greater is the uncertainty that a partic- 
ular message will be selected. Thus, greater freedom of choice, greater uncer- 
tainty and greater information go hand in hand. It is important to note that this 
uncertainty-based concept of information must not be confused with meaning. 
Although messages usually do have meaning, this is irrelevant to the mathemat- 
ical formulation of information on the basis of uncertainty and entropy. Two 
messages, one heavily loaded with meaning and the other of pure nonsense, can 
have the same amount of information. 

The theory of social systems refers to Shannon and Weaver and uses as a 
starting point t he same definiti on: information is a selection from a repertoire 
of possibilities ( Luhmann IQQSl pg. 140). But contrary to information theory, 



meaning is addressed prominently. It is conceptualized as the durable comple- 
ment of the transient information. Meaning is a complex referential structure 
which is used by psychic systems (i.e., information sources) to organize the 
process of information selection. 
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By information we mean an event that selects system states. This 
is possible only for structures that delimit and presort possibilities. 
Information presupposes structure, yet is not itself a structure, but 
rather an event that actualizes the use of structures. . . . Time it- 
self, in other words, demands that meaning and information must 
be distinguished, although all meaning reproduction occurs via in- 
formation (and to this extent can be called information processing) , 
and all information has meaning. ... a history of meaning has al- 
ready consolidat ed structures that we treat as self-evident today. 
(|Luhmannlll"995l pg. 67) 



According to social systems theory, the historically evolved general meaning 
structure is represented on the micro le vel (i.e., psychic systems, agent level) 
in the form of the personal life-world (|Luhmannl [iQqI pg. 70). A personal 



meaning-world represents a structurally pre-selected repertoire of possible ref- 
erences. Although the repertoire of possibilities is limited, selection is necessary 
to produce information. Meaning structure provides a mental map for selection 
but does not replace it. All together, the one is not possible without the other — 
information production presupposes meaning structure and the actualization of 
meaning is done by information production. 

The present model considers both information and personal meaning struc- 
ture in a simplified manner. Information is treated in the above described math- 
ematical form and meaning structures in the form of a second order approach 
(pairwise combination). This will be developed in section [31 below. 

2.2 Interaction and interaction sequence. 

Communicative interaction depends on the availability of information producing 
agents. Individuals or other agents are therefore often the elemental unit of soci- 
ological communication analysis. Contrary to that, communication units are the 
central element of analysis in social system theory and agents are conceptualized 
according to their contribution to communicatio n units. Each communication 
unit consists of three selections ( Luhmann 1995f) : 



1. Information is produced by selection from a structured repertoire of pos- 
sibilities by an agent (alter). 

2. The agent (alter) must choose a behavior that expresses the produced 
information (act of utterance). 

3. Another agent (ego) must understand what it means to accept alter's 
selection. 

Information production includes therefore that alter not only views itself 
as part of a general meaning world but assumes also that the same is true for 
ego. In other words, alter and ego act on the assumption that each life-world 
is part of the same general meaning world in which information can be true 
or false, can be uttered, and can be understood. This assumption of shared 
meaning structures allows alter to generate an expectation of success, i.e., the 
expectation that the selection will be accepted by ego. 

By 'success' I mean that the recipient of the communication ac- 
cepts the selective content of the communication (the information) 
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as a premise of his own behavior, thus joining further selections to 
the primary sele ction and reinforcing its selectivity in the process. 
(|Luhmannlll"990l pg. 88) 



An interaction system is constituted by one communication unit following 
another. The c losure of an interaction system is given by two constituents 
( Luhmaiml Il990l pg. 5ff): participants must be present and communication 



units can only be understood in the context of the system. Consequently, at 
least a minimum extent of understanding is necessary for the generation of an 
interaction system. If one communication unit directly follows another, it is a 
positive test that the preceding communication unit was understood sufficiently. 
Every communication unit is therefore recursively secured in possibilities of 
understanding and the control of understanding in the connective context. 

In the present model, the term "message" is used to describe the three parts 
of the communication unit. From this follows that the model is at its core a 
message exchange model (ME model) and starts with a list of messages. The 
three-part unity is now: 

1. Alter selects a message. 

2. Once the selection process is finalized, the message is sent. 

3. Ego receives the sent message. 

Within the present version of the ME model, agents do not have the option 
to reject a message or to refuse to answer. Their "freedom" is incorporated in 
the process of message selection. Agents can be distinguished by the number 
of messages they are able to use and by their selection strategies. The ongo- 
ing exchange of messages creates an interaction sequence with a limited — but 
possibly high — number of steps. 

Overall, the term "interaction" is considerably simplified in comparison to 
the full Luhmannian notion. Additionally, the notion of interaction system is 
not used in the present version of the model, because "understanding" is not 
considered. 



2.3 Double contingency and expectation-expectation. 

In social system theory, the dynamics of an interaction sequence are explained 
by contingency and expectatior{3. First, contingency arises because agents are 
complex systems that are "black boxes" for each other. An agent will never 
know exactly what the other will do next. The reciprocity of social situations 
(two agents: alter and ego) results in double contingency: alter contingency and 
ego contingency. 

Second, the black-box problem is solved by expectations. Agents create 
expectations about the future actions of the other agents to adjust their own 
actions. For example, two business partners meeting antemeridian expect from 
each other that the adequate greeting is good morning (and not good evening, 
etc.). This is a situation with high expectational security. If the two had 
a confiict last time they met, the expectational security may decrease (will 

^Expectations in social systems theory are different from expectations in probability or 
statistics; the meaning is closer to the statistical notion of conditional expectation. In this 
work, we use the term in the social systems sense. 
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the other answer my greeting?). The e xample illustrate s that expectations are 
condensed forms of meaning structures (jLuhmann I1995I pp. 96, 402). Based on 



the societally given and biographically determined structures of expectations, 
alter and ego have situation-specific reciprocal expectations. 

In social systems, expectations are the temporal form in which struc- 
tures develop. But as structures of social systems expectations ac- 
quire social relevance and thus suitability only if, on their part, they 
can be anticipated. Only in this way can situations with double 
contingency be ordered. Expectations must become reflexive: it 
must be able to relate to itself, not only in the sense of a diffuse 
accompanying consciousness but so that it knows it is anticipated as 
anticipating. This is how expectation can order a social field that 
includes more than one participant. Ego must be able to anticipate 
what alter anticipates of him to make his own antic ipations and be- 



havior agree with alter's anticipation. (|LuhmanrJll995 . 
pg. 303) 

The duplication of contingency causes the duplication of expectation: 

• Alter has expectations vis-a-vis ego. 

• Ego knows that alter expects something from him/her, but can never know 
what exactly the expectation is. 

• Therefore, ego builds expectations about "alter's expectations of ego," 
that is, expectation-expectation. 

In complex societies, expectational security is difficult. Therefore, it is nec- 
essary to have stabilizing factors. One way to establish expectations that are 
relatively stabilized over time is through relation to something which is not itself 
an event but has duration. Examples of well-known expectational nexuses in 
sociology are persons and roles. Persons are societally constituted for the sake of 
ordering of behavioral expectations. Roles serve — compared with persons — as 
a more abstract perspective for the identification of expectations. 

Within the ME model in its present form, the agents are the relevant expec- 
tational nexuses. 

2.4 Memories and memory development. 

The agents are equipped with three different types of memories: initially, their 
behavior tends towards a general long-term memory which represents a given 
scope of interaction. That is, they are able to use a certain number of mes- 
sages in interaction sequences. The ability to use messages is determined by 
the above mentioned pairwise combination. The model agents start with a ran- 
domly or otherwise specified stimulus-response matrix (response disposition, see 
section I3.3.ip . 

Since memory development depends on expectation driven interaction, agents 
are equipped with specific memories allowing them to construct expectations. 
Thi s is done by the int roduction of specific ego- and alter-memories (follow- 
ing Dittrich et al. The ego memory represents an agent's expectation 



towards itself and the alter memory an agent's expectation-expectation (i.e.. 
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expectations of agent B about the expectations of agent A vis-a-vis agent B). 
The model starts with pristine ego- and alter- memories which will be updated 
with every interaction. The links between the long-term memory and ego- and 
alter-memory will be explained in section [31 

2.5 Interaction strategies. 

Within the presented model, interaction strategies are selection strategies. All 
three memories are used in each information- production activity (i.e., selection 
from the pairwise structured repertoire of possibilities in the form of stimulus- 
response matrices). The way the memories are used defines the interaction 
strategy. Agents can be specified to follow various strategies, e.g., "conser- 
vative" agents maximize towards predictability, "innovative" agents towards 
unpredictability. 

Interaction strategies are formalized in section 13.3.21 The strategies re- 
sult from balancing out conflicting behavioral intentions for agent interactions. 
Specifically: (1) ego-oriented behavior, maintaining individual behavior patterns 
disregarding the interaction partner; (2) interaction-oriented behavior, providing 
stable responses for the interaction partner through consistent and comprehensi- 
ble behavior; (3) uncertainty-oriented behavior, seeking or avoiding unexpected 
responses from the interaction partner; and (4) risk-oriented behavior, handling 
novel or erroneous responses. We investigate the effect of various strategies 
using analytical (section [4. ip and computational (section [4. 2p methods. 

In this work, we focus on how strategies affect interaction; we will not present 
rules for choosing strategies. However, it is clear that the choice of a strategy is a 
way for agents to cope with the problem of appropriate behavior in a situational 
context. A desirable — and challenging — extension of the presented model would 
be to develop rules for choosing and evaluating strategies. 



3 Model definition. 

3.1 Messages. 

In the model, the fundamental elements of agent interactions are messages. 
The number of messages is assumed to be finite. We do not concern ourselves 
here with giving a real-world interpretation to the messages, and instead simply 
number them from 1 to the number of messages ng. 

Messages are transmitted through a multi-step process. A message is first se- 
lected, then sent and simultaneously received by the interaction partner. We do 



not co nsider the medium in which the message is sent. Following iDittrich et al 



([20031), we assume that messages are distinct from one another. This assump- 
tion provides a significant simplification over, e.g., natural languages, ignoring 
issues such as ambiguity or transmission errors. 

It can be convenient to recognize that messages transmitte d by an agent 



are fo rmally equivalent to states of the agent. A construction bv IDittrich et al 



(|2003h illustrates this in a readily accessible fashion: each agent has a number of 



signs available, and communicates a message by holding up the appropriate sign. 
Thus, selecting a new message to transmit is equivalent to a state transition — 
the agent holding up a new sign. 
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3.2 Interaction sequences. 



Agents take turns sending messages to each other. In doing so, they record 
the foregoing sequence of messages, and must select the next message in the 
sequence. 

We anticipate that the message selected will depend most strongly on the 
most recent messages in the sequence, while the exact ordering of earlier mes- 
sages is of lesser or negligible importance. This suggests that a reasonable 
approximation of the selection process is to consider only a finite number of pre- 
ceding messages, rather than the complete sequence. Thus, the agents should 
select the message based on the statistics of subsequences of various lengths. 
Earlier portions of the message sequence are used to define the agents and the 
rules by which they select messages (see section [?75)) . 

The length of the subsequences in such approximations define the order 
of the approximation. A zeroth-order approximation considers only message 
identity, ignoring how the messages actually appear in sequences. A first-order 
approximation considers message sequences of length one, showing the relative 
frequency with which the messages appear in sequences. Second- and higher- 
order approximations consider runs of messages, showing correlations between 
messages in the sequences. 

In this paper, we focus exclusively on a second-order approximation, dealing 
with pairs of messages. A second-order approximation is the simplest case in 
which relationships between messages become important, and can be usefully 
interpreted as dealing with stimuli and responses. However, it is vitally impor- 
tant to recognize that each message is simultaneously used as both stimulus and 
response. 

In this work, we adopt the point of view that all agent interactions are ex- 
pressed and interpreted in terms of message sequences. Consequently, we assess 
interactions in terms of message sequences as well. An example of how to do 
this is to identify a specific agent's behavior as authoritative, effectively defin- 
ing a utility function; the student-teacher situation described in section 14.2.41 
is analyzed in this way. Interaction of a single agent with its environment can 
be treated in the same fashion by describing the environment itself as a second 
agent and the interaction as messages. 



3.3 Agents. 

In this section, we formally define the agents. Essentially, an agent maintains 
a number of memories that describe statistics of message sequences in which it 
has taken part, and has a rule for using those memories to select a response 
message to a stimulus messa ge. The agents are a generalization of the agents 
used bv lOittrich et al. ( 2003 ): see section [3.3.21 



Although we focus in this work on interactions between pairs of agents, the 
agent definitions arc posed in a manner that can be generalized to communities 
of agents. See section [5?^ for discussion of how the model could be extended to 
handle groups of agents. 



3.3.1 Memories. 

Agent memories fall broadly into two classes: (1) memories of the details of 
particular interaction sequences between one agent (alter) and another agent 
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(ego) and (2) long-term memories reflecting the general disposition of a single 
agent. All of the memories are represented as probability distributions over 
stimuli and responses. 

We use three different forms of probabilities in the definition and application 
of the agent memories. For example, consider the ego memory (defined fully 
below). We denote by s'^g ir^s) the joint probability that agent A maintains 
at time t, estimating the likelihood that agent B will transmit a message s to 
which agent A will respond with a message r. Second, we use the conditional 
probability s'^^^g (r | s), showing the likelihood of agent A responding with mes- 
sage r given that agent B has already transmitted message s. Finally, we use the 
marginal probability £')l^ (r) , showing the likelihood of agent A responding with 
message r regardless of what message agent B sends. These three probabilities 
are related by 

s 

f (r\s) - ^' (2) 

We use similar notation for all the memories (and other probabilities) in this 
work. 

The memories o f messages exchanged are patterned after those used by 



Dittrich et al.l (j2003f ). They take two complementary forms, the ego memory 
and the alter memory, reflecting the roles of the agents as both sender and 
receiver. 

The ego memory S^^g (r, s) is a time-dependent memory that agent A main- 
tains about interactions with agent B, where agent B provides the stimulus s to 
which agent A gives response r. During the course of interaction with agent B, 
the corresponding ego memory is continuously updated based on the notion that 
the ego memory derives from the relative frequency of stimulus-response pairs. 
In general, an agent will have a distinct ego memory for each of the other agents. 

Before agents A and B have interacted, agent A's ego memory s'^^ (r, s) (as 
well as agent i?'s counterpart (r, s)) is undefined. As agents A and B inter- 
act through time t, they together produce a message sequence mi, TO2, ■ • ■ , iTit-i, iTit- 
Assuming that agent B has sent the first message, one view of the sequence is 
that agent B has provided a sequence of stimuli mi, ma, . . . , mts, mt-i to which 
agent A has provided a corresponding set of responses m2, m4, . . . , mt-2, irit- If 
agent A began the communication, the first message could be dropped from the 
communication sequence; similarly, if agent B provided the final message, this 
"unanswered stimulus" could be dropped from the communication sequence. 

With this view of stimuli and responses, the ego memory for agent A is 
defined by 

2 



4b (^^)- 7 E ' (3) 



= 1,3,5,. 



for all r and s. In Eq. ([3]), we assume that the memory has an infinite capacity, 
able to exactly treat any number of stimulus-response pairs. A natural and 
desirable modification is to consider memories with a finite capacity. 
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Limited memory could be incorporated in a variety of ways. The approach 

(E) 

we take is to introduce a "forgetting" parameter A)j , with a value from the 
interval [0, 1], such that the ego memory is calculated as 

1 - * 

4'],(r,.) = 2— — f- Yl (4) 

1 (-^A ) 1=1,3,5,... 

for all r and s. The memory of a particular message transmission decays expo- 
nentially. 

The alter memory A^^g (r, s) is analogous to the ego memory, but with the 

roles of sender and receiver reversed. Thus, A^^^ (r, s) is the memory that 
agent A maintains about message exchanges with agent B, where agent A pro- 
vides the stimulus s to which agent B gives response r. The procedure for 
calculating the alter memory directly parallels that for the ego memory, except 
that the messages sent by agent A are now identified as the stimuli and the 
messages sent by agent B are now identified as the responses. The calculation 
otherwise proceeds as before, including making use of a forgetting parameter 
A^' for the alter memory. 

A useful symmetry often exists between the ego and alter memories of the 
agents. Using the alter memory ^J^^(r, s), agent A tracks the responses of 
agent B to stimuli from agent A. This is exactly what agent B tracks in its ego 
memory Sl^l^ (r,s). Thus, the memories of the two agents are related by 

A^*,^gir,s)=£l^\{r,s) . (5) 

However, Eq. ^ holds only if the agent memories are both infinite or A^'' — 
Xg \ The corresponding relation A^g\ (r, s) ~ s'^g (r, s) holds as well when the 

memories are both infinite or A^"* — A^-* . 

Besides the ego and alter memories, an agent has another memory called the 
response disposition. The response disposition Qa of agent A is, like the ego and 
alter memories, represented as a joint distribution, but there are many marked 
differences. Significantly, the response disposition is not associated with another 
particular agent, but instead shows what the agent brings to all interactions. In 
particular, the response disposition can act like a general data bank applicable 
to any other agent. Further, the response disposition changes more slowly. 
Instead, the update occurs after the interaction sequences. In this work, we 
hold the response dispositions fixed, so we defer discussion of a possible update 
rule for the response disposition to section [Ol 



3.3.2 Transition probability. 

The memories of an agent are combined to produce a transition probability, 
which is, in turn, used to randomly select message s. The form of t he transition 
probability is a generalization of that used bv .Dittrich et al.l (|2003|) . Differences 



arise from the differing goals of the two models: iDittrich et al.l consider the 
formation of social order, while wc are interested in strategies of interaction. 

The transition probability is constructed so as to deal with several distinct is- 
sues. Because of this, we do not develop the transition probability as an optimal 
selection rule according to one criterion, but rather as a compromise between 
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several conflicting factors arising in a situation of double contingency. We con- 
sider three such factors: (1) agent preference, (2) satisfaction of expectations, 
and (3) treatment of individual uncertainty. 

The first factor that we consider is agent preference. An agent tries to select 
messages that are correct for the situation, based on its prior experiences. The 
response disposition matches well with this notion, since, as discussed above, it 
is a relatively stable memory of the general agent behavior. 

The second factor that we consider is that an agent tries to satisfy the 
expectations that its interaction partner has about the agent itself. This is the 
expectation-expectation. Consider interactions between agents A and B. While 
selecting which message to send, agent A estimates what message agent B would 
expect to receive from agent A. In terms of the memories of agent A, this is 
expressed as the ego memory f^*^(r, s). Thus, what responses agent A has 
given to agent B in the past will be favored in the future, so that there is a 
continual pressure for an agent to be consistent in its responses. 

The third factor we consider is that an agent takes into account the level of 
certainty of the expected response from its interaction partner. By favoring mes- 
sages that lead to unpredictable responses, the agent can pursue "interesting," 
interactions of greater complexity. Alternatively, the agent may favor messages 
that lead to predictable responses, pursuing simpler (and more comprehensible) 
interactions. 

A natural approach is for agent A to calculate, based on previous responses 
of agent B, the uncertainty for each possible message r that it could send. This 
corresponds to calculating the entropy based on the conditional alter memory 
-^A^B I yielding 



n 



^AlB (• I Ol = - E ^AlB I log„, A% {r' I r) . (6) 



The base of the logarithm is traditionally taken as 2, measuring the entropy 
in bits, but in this work we take the base to be ns, the number of different 
messages. With this choice, the entropy take on values from the interval [0, 1], 
regardless of the number of messages. 

The uncertainty 7i -^^j^ (• \ f) is closely related to the expectation-certainty 

from Dittrich et al. I (|2003h . Note that all possible messages r are considered, 
so that agent A may send a message to agent B that is highly unlikely based 
on a just-received stimulus. Thus, we expect that resolving uncertainty and 
satisfying expectations will come into conflict. 

The transition probability is assembled from the foregoing components into 
a conditional probability distribution with form 



U^^l^ (r I .) (X c^^^Qa (r | s) + c^^^£f^^ [r \ s) + cf^n ^^^f^ (• | r) 

""{7) 

(A) (A) (A) 

The parameters c\ , , and Cg reflect the relative importance of the three 
factors discussed above, while c^^'^ is an offset that provides a randomizing 
element. The randomizing element provides a mechanism for, e.g., introducing 
novel messages or transitions. It also plays an important role on mathematical 
grounds (see the appendix). 



i(0 



ns 
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Note that we allow the parameter Cg"^-* to be negative. A positive Cg"^-* corre- 
sponds to agent A favoring messages that lead to unpredictable responses, while 
a negative c'^^ corresponds to agent A "inhibiting" messages that lead to un- 
predictable response, hence favoring messages leading to predictable responses. 
When Cg"^^ is negative, Eq. ([7]) could become negative. Since W^*^^ {r | s) is a 
probability, we deal with this negative-value case by setting it to zero whenever 
Eq. ([7]) gives a negative value. 

Clearly, one may expect different results in the outcome of interactions when 

(A) 

Cg is changed from a positive to a negative value. This is illustrated mathemat- 
ically in the appendix and numerically in section [4.21 We do, in fact, observe a 
sharp transition in the entropy of the alter and ego memories of the agent (mea- 
suring the average uncertainty in their response) near Cg"^'' = 0. Note however 
that the transition depends also on the other parameters and does not necessar- 

(A) 

ily occur exactly at c\ = 0. This is discussed in the appendix. The transition 



is reminiscent of what iDittrich et al.l call the appearance of social order, though 
our model is different, especially because of the response disposition which plays 
a crucial role in the transition. 

Appropriate choice of the co efficients ci^\ c^2^\ '^3^\ "^4^^ allows the 



transition probabilities used by IPittrich et al. to be constructed as a special 



case of Eq. ([7]). In particular, their parameters a, c/, and N are related to the 
ones used here by 



-1 

-3 



- (8) 

= 1 - a (9) 

= -a (10) 

^ ns(a + ^) . (11) 



Note finally that the transition probability given in Eq. ([7]) is not appropri- 
ate for selecting the initial message of an interaction sequence. However, the 
response disposition provides a useful mechanism for selection of the initial mes- 
sage. We calculate a marginal probability for the response r from the response 
disposition Qa (r, s), using 

Qa (r) = Y,Qa (r, s) . (12) 

s 

With the marginal probability of initial messages Qa (r) and the conditional 
probability for responses 1^a^\b I '^)' stochastic simulations of the model system 
are relatively straightforward to implement programmatically. 



4 Results. 

4.1 Summary of mathematical results. 

In this section we briefly summarize the mathematical results obtained in the 
paper. A detailed description of the mathematical setting is given in the ap- 
pendix. Supporting numerical results are presented in section [4?2l 
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The model evolution is a stochastic process where the probability for an 
agent to select a message is given by a transition matrix, as in a Markov pro- 
cess. However, the transition matrix is history-dependent and the corresponding 
process is not Markovian. The general mathematical fra mework for thi s type of 
process is called "chains with complete connections. " (|Maillardll200l A brief 
overview is given in the appendix. 

On mathematical grounds, a precise description of the pairwise interaction 
between agents requires answering at least the following questions: 

1. Does the evolution asymptotically lead to a stationary regime ? 

2. What is the convergence rate and what is the nature of the transients? 

3. How can we quantitatively characterize the asymptotic regime? 

We address these briefly below. Many other questions are, of course, possible. 

Does the evolution asymptotically lead to a stationary regime? In the model, 
convergence to a stationary state means that, after having exchanged sufficiently 
many symbols, the "perceptions" that each agent has of the other's behavior 
(alter memory) and of its own behavior (ego memory) have stabilized and do 
not evolve any further. The probability that agent A provides a given symbol as 
a response to some stimulus from agent B is not changed by further exchanges. 

Mathematically, the convergence to a stationary state is not straightfor- 
ward. One cannot a priori exclude non-convergent regimes, such as cycles, 
quasi-periodic behavior, chaotic or intermittent evolution, etc.. A cyclic evo- 
lution, for example, could correspond to a cyclothymic agent whose reactions 
vary periodically in time. More complex regimes would essentially correspond to 
agents whose reactions evolves in an unpredictable way. Though such behaviors 
are certainly interesting from a psychological point of view, they are not desir- 
able in a model intended to describe knowledge diffusion in a network of agents 
assumed to be relatively "stable." It is therefore desirable to identify conditions 
that ensure the convergence to a stationary state. These conditions are dis- 
cussed in the appendix. We also establish explicit equations for the memories 
in the stationary state. 

What is the convergence rate and what is the nature of the transients ? The 
answer to this question is as important as the previous one. Indeed, the investi- 
gated asymptotic behavior is based on the limit where interaction sequences are 
infinitely long, while in real situations they are obviously finite. However, if the 
characteristic time necessary for an agent to stabilize its behavior is significantly 
shorter than the typical duration of an interaction, one may consider that the 
asymptotic regime is reached and the corresponding results essentially apply. In 
the opposite case, the behavior is not stabilized and important changes in the 
stimulus-response probabilities may still occur. This transient regime is more 
difficult to handle and we have focused on the first case. The characteristic time 
for the transients depends on the parameters ci through C4. 

How can we quantitatively characterize the asymptotic regime? Addressing 
this question basically requires identifying a set of relevant observables — that 
is, a set of functions assigning to the current state of the two-agent system a 
number that corresponds to a rating of the interaction. We have obtained some 
mathematical and numerical results about the variations of the observables when 
the coefficients in Eq. ([7|) are changed. The most salient feature is the existence 
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of a sharp transition when C3 changes its sign. Simulations demonstrating the 
transition are presented in section 14.2.31 

We have chosen several observables. The entropy of a memory measures the 
uncertainty or unpredictability of the stimulus-response relationship. A mem- 
ory with high entropy is representative of complex (and, presumably, hard to 
understand) interaction sequences. We inspect the entropy of both the ego and 
alter memories. The distance or overlap between two memories measures how 
close are their representations of the stimulus-response relationship. Particu- 
larly interesting is the distance between the response disposition of an agent A 
and the alter memory another agent B maintains about agent A, which shows 
the discrepancy between the actual behavior of agent A and agent B's expecta- 
tion of that behavior. Note that this representation is necessarily biased since 
agent B selects the stimuli it uses to acquire its knowledge about agent A. This 
induces a bias (see appendix for a mathematical formulation of this) . 

To recap, in the appendix we provide a mathematical setting where we dis- 
cuss the convergence of the dynamics to a state where memories are stabilized. 
We obtain explicit equations for the asymptotic state and discuss how the so- 
lutions depend on the model parameters. In particular, we show that a drastic 
change occurs when C3 is near zero. We assign to the asymptotic state a set 
of observables measuring the quality of the interaction according to different 
criteria. 



4.2 Simulation results. 
4.2.1 Types of agents. 

A broad variety of agents can be realized from the definitions in section | 
The value used for the response disposition reflects an individualized aspect 
of an agent's behavior, while the various other parameters allow a wide range 
of strategies for interaction. In Eq. ()50p . we give an analytic expression for 
the memories in the asymptotic regime. However, an extensive study of the 
solutions of this equation, depending on six independent parameters, is beyond 
the scope of this paper. Instead, we focus in this section on a few situations 
with different types of agents, as characterized by their response dispositions. 

One type of agent has a random response disposition matrix, created by se- 
lecting each element uniformly from the interval [0, 1] and normalizing the total 
probability. With random response dispositions of this sort, the contribution 
to the agent behavior is complex but unique to each agent. A second type of 
agent has its behavior restricted to a subset of the possibilities, never producing 
certain messages. A response disposition for this type of agent, when presented 
as a matrix, can be expressed as a block structure, such as in Fig.[T] A third and 
final type of agent response disposition that we consider has a banded structure, 
as in Fig. [3 allowing the agent to transmit any message, but with restricted 
transitions between them. 

The parameters c\^\ ^^'^ '^4^^ ^-l^o pl^^Y important roles in the 

behavior of an agent A. We impose a constraint with the form 

c(^)+c(^)+4^)+ci-^) = l . (13) 

The constraint is in no way fundamental, but is useful for graphical presentation 
of simulation results. 
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4.2.2 Complementary knowledge. 

As an initial simulation, we consider interactions between two agents with block- 
structured response dispositions, as shown in Fig. [T] The number of messages 
is = 1 1 . The elements of the probability matrices are shown as circles whose 
sizes are proportional to the value of the matrix elements, with the sum of the 
values equal to one. 

The situation shown in Fig. [T] corresponds to a case of "complementary 
knowledge," where the agent behaviors are associated with distinct subdomains 
of the larger context. Each response disposition matrix has a 7 x 7 block where 
the elements have uniform values of 1/49, while all other elements are zero. 
The blocks for the two agents have a non-zero overlap. The subdomain for 
agent A deals with messages 1 through 7, while that for agent B deals with 
messages 5 through 11. Thus, if agent B provides a stimulus from {5,6,7}, 
agent A responds with a message from {1, 2, . . . , 7} with equiprobability. The 
coefficients c"^'' and c"^'' are both set to 0.01. These terms are useful to handle 
the response of the agent when the stimulus is "unknown" (i.e., it is not in the 
appropriate block). In such a case, the agent responds to any stimulus message 
with equal likelihood. 

Interaction between the two agents consists of a large number of short in- 
teraction sequences. We study the evolution of the alter and ego memories of 
each agent as the number of completed interaction sequences increases. Each 
interaction sequence consists of the exchange of 11 messages between the two 
agents, with the agents taking part in 128000 successive interaction sequences. 
For each interaction sequence, one of the two agents is chosen to select the initial 
message at random, independently of how previous sequences were begun (ran- 
domly choosing this "starting agent" is intended to avoid pathological behavior 
leading to non-generic sequences). 

We first checked the convergence of S'^^ (r) to the first mode of i^A\B^B\A 
(see section rSTSj) . We measure the distance between these two vectors at the end 
of the simulation. The distance tends to zero when the number of interaction 
sequences T increases, but it decreases slowly, like 1/T (as expected from the 
central limit theorem). An example is presented in Fig.[2]for c^"^"* = c^^'' = 0.99 
and 4^)=cf)=4^)=cf)=0. 

We next investigated four extremal cases 

0;ci^^ = 0.01), (4^^ = 0;4^' ^ 0.99; = 0; 4^^ = 0.01), (4^' = 0;4^^ = 

0;4^^ = 0.99; 4^^ = 0.01), and (4^^ = 0.99; 4^^ = 0.99; 4^^ = -0.99; 4^' = 
0.01). For each case, we first examined the convergence to the asymptotic 
regime. We show the evolution of the joint entropy in Fig. [21 This allows us to 
estimate the time needed to reach the asymptotic regime. 

Next, we plotted the asymptotic values of the ego and alter memory of 
agent A, averaged over several samples. They are shown in Fig. H] for A'-'^-' = 
A(^) = 1. InFig.El we show the values with a finite memory (A(^) = A(^) = 0.99, 
corresponding to a characteristic time scale of approximately 100 steps). Our 
main observations are: 

1. c[^^ = 0.99; 4^^^ = 0;c^^^ = 0;4^^ = 0.01. The responses of each agent 
are essentially determined by its response disposition. The asymptotic 
behavior is given by 'P^^'' and , the first modes of the matrices QaQb 



15 



and QbQa, respectively. Fig.[2]shows indeed the convergence to this state. 
The asymptotic values of the alter and ego memories are given by QaP'^^^ 
and Qb'P'^ab "^^e memory matrices have a "blockwise" structm^e with a 
block of maximal probability corresponding to the stimuli known by both 
agents. 

2. cf^ = Q;c^^^ = 0.99; c^-^^ = 0;ci'^^ = 0.01. The response of each agent 
is essentially determined by its ego memory. As expected, the maximal 
variance is observed for this case. Indeed, there are long transient corre- 
sponding to metastable states and there are many such states. 

3. c^^^ = 0; C2^^ = 0; = 0.99; €4^^^ = 0.01. The response of each agent is 
essentially determined by its alter memory, and the probability is higher 
for selecting messages that induce responses with high uncertainty. Again, 
the asymptotic state is the state of maximal entropy. 

4. c[^'^ = 0.99; c^^^ = 0.99; 4"^' = -0.99; c^^^ = 0.01. The response of each 
agent is essentially determined by its alter memory, but the probability is 
lower for selecting messages that induce responses with high uncertainty. 
The asymptotic memories have an interesting structure. The alter and 
ego memories have block structures like that of both agents' response 
dispositions, but translated in the set of messages exchanged (contrast 
Fig. [1] and the bottom row of Fig. [5]). The effect is clear if we keep in 
mind that, for agent ^'s ego memory, the stimulus is always given by 
agent _B, while for agent A's alter memory, the stimulus is always given 
by agent A. 

Finally, we note that, in this example, the asymptotic alter memory does 
not become identical to the other agent's response disposition. Indeed, the 
perception that agent A has from agent B is biased by the fact that she chooses 
always the stimuli according to her own preferences (described in the appendix 
in more mathematical terms) . A prominent illustration of this is given in Fig. [S] 
(bottom row). 

These simulations shows how the final (asymptotic) outcome of the interac- 
tion may differ when the parameters Cj are set to different, extremal values. The 
role of the forgetting parameter is also important. In some sense, the asymptotic 
matrices whose structure are the closest to the block structure of the response 
dispositions are the matrices in the bottom row of Fig. [51 where the agent are 
able to "forget." There is a higher contrast between the blocks seen in the 
corresponding row of Fig. [2 when A^-^) = A(^) = 1. 

4.2.3 Behavior dependence on the parameters ci, 02,03,04. 

Based on theoretical (section [A. 4p and numerical (section [4. 2. 2p considerations, 
one expects changes in the asymptotic regime as the parameters c-^^ and c^-^'' 
vary. Note, however, that the general situation where agents A and B have 
different coefficients requires investigations in an 8 dimensional space. Even if 
we impose constraints of the form in Eq. (113p . this only reduces the parameter 
space to 6 dimensions. A full investigation of the space of parameters is therefore 
beyond the scope of this paper and will be done in a separate work. To produce 
a manageable parameter space, we focus here on the situation where agents A 
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and B have identical coefficients and where C4 is fixed to a smah value (0.01). 
The number of messages exchanged by the agents is ns — 11. 

To explore the reduced, C1C2 space, we simulate interaction with response 
dispositions of both agents selected randomly (see section l¥. 2. II for details). We 
conduct 10000 successive interaction sequences of length 11 for each pair of 
agents. In each sequence, the agent that transmit the first message is selected 
at random. Therefore we have a total exchange of 110000 messages. 

We compute the asymptotic value of the joint entropy of the memories. We 
average the entropy over 100 realizations of the response dispositions. This 
allows us to present, e.g., the entropy of the alter memory as a function of ci 
and C2. 

In Fig. [51 we show the joint entropy of the alter memory (the joint entropy of 
the ego memory is similar) . The parameters ci and C2 both vary in the interval 
[0,0.99]. Consequently, C3 G [—0.99 : 0.99]. There is an abrupt change in the 
entropy close to the line ci + C2 = 0.01 (C3 ~ 0). The transition does not occur 
exactly at said line, but depends on ci and C2. 

The surface in Fig. [H] shows a drastic change in the unpredictability of the 
pairs, and thus in the complexity of interactions, when crossing a critical line 
in the C1C2 parameter space. Note, however, that the transition is not precisely 
at C3 = 0, but rather to a more complex line as discussed in the appendix. 



4.2.4 Rates of change. 

While the asymptotically stable states of the memories are useful in understand- 
ing interaction, they are not the only important factors. Additionally, the rates 
at which the memories change can be important. In particular, an inferior result 
quickly obtained may be more desirable than a superior result reached later. 

To investigate rates, we simulate two distinct types of agents exchanging 
ns = 5 messages. The first type of agent is a "teacher" agent A that has 
relatively simple behavior. Its response disposition has a banded structure, 
shown in Fig. [71 The teacher agent has a fixed behavior dependent only upon 
its response disposition {c^2^^ = Cg"^' = c'"^^ = 0). The values of A^' and A^'' 
are irrelevant, since the alter and ego memories contribute nothing. 

The second type is a "student" agent B that has more general behavior. The 
student agent has a random response disposition, as described in section [4.2.11 
In general, the student agent uses all of its memories to determine the transition 
probability, and we vary the values of c^^\ c^r^\ c^^\ and c^^'' to investigate 
the effects of the various parameters. We take the memories to be infinite. 

To assess the simulations, we focus on how rapidly the student agent is able 
to capture the behavior of the teacher agent in its memories. It is natural 
to determine this using the difference between the response disposition of the 
teacher agent and the alter memory of the student agent. We measure the 

difference using the distance d (yQA,A^BA^ ' defined by 

d^{QA.A^L)^\Y.{QA{r.s)-A%\{r,s)) . (14) 

r.s 

The value of d (QaiA^j^A always lies in the interval [0, 1]. 



17 



The distance defined by Eq. (|14p requires memories from both agents, and 
thus cannot actually be calculated by either agent. However, by using the 
identity given by Eq. (O, we see that cP (^QA,Ag\^ — cP (^Qaj^ab)' which 
depends only on memories of agent A and is therefore in principle available 
to agent A. This reveals an interesting asymmetry in the system; the teacher 
agent A has an ability to assess the interaction that the student agent B lacks. 

A simple way to determine a rate of change from the memories is to base 
it on the number of message exchanges it takes for the distance to first become 
smaller than some target distance. If N messages are exchanged before the 
target distance is reached, the rate is simply 1/iV. The simulated interaction 
can be limited to a finite number of exchanges with the rate set to zero if the 
target is not reached during the simulation. 

In Fig. [51 we show the rates for different distances, averaged over 50 samples. 



In Fig. 8(a) the rate determined from a target distance of 0.1 is shown. For this 



case, the rates have a simple structure, generally proceeding more rapidly with 
Cg^^ < 0. In contrast, Fig. [8(b)] presents quite different results based on a target 
distance of 0.05, with more complex structure to the rates. In this latter case, 
the highest rates occur with Cg > 0. We note that inclusion of the response 
disposition is crucial in this scenario. In particular, the rate of change goes to 
zero if the student agent suppresses its response disposition. 



5 Outlook. 

5.1 Assessment of interactions. 

In section !!}. 3. H we presented rules for updating the ego and alter memories of an 
agent. These memories are updated during the course of interactions in which 
the agent is involved. In contrast, the response disposition is held fixed during 
interaction. In this section, we present a scheme for update of the response 
disposition, occurring intermittently at the end of interaction sequenceso 

Thus far, the response disposition Qa {r, s) has been presented as time in- 
dependent. To generalize this, we number the interaction sequences and add 
an index to the response disposition to indicate the sequence number, giving 

Not all interactions are successful. For a two-party interaction, it is possible 
that either or both of the parties will gain nothing, or that both benefit. We 
can incorporate this into the model by introducing an assessment process for 
allowing updates to the response dispositions of the agents. 

To begin the assessment process, we can first calculate a joint probability 
distribution representative of the message sequence that was constructed during 
the interaction. Assuming the message sequence consists of k messages labeled 
mi, m2, . . . , TOfe_i, nik, we find J-^ , the empirical frequency of message pairs for 
interaction sequence T, using 

^For long interaction sequences, the response disposition could be updated periodically — 
but less frequently than the ego and alter memories — during the sequence. 
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for all r and s. Note that Eq. (|15p utilizes all of the messages exchanged, 
regardless of whether the agent ego and alter memories have infinite memories 
(i.e., A^^ = A^^ = 1) or undergo "forgetting" (i.e., A^^^ < 1 or A^^^ < 1). This 
is due to an assumed period of reflection in which details of the interaction can 
be considered at greater length. 

To actually make the comparison, we find the distance between the behavior 
expressed in the interaction sequence (using T^) and that expressed by the 
response disposition of the agents. A small distance is indicative of relevance 
or usefulness of the interaction sequence to the agent, while a great distance 
suggests that the interaction sequence is, e.g., purely speculative or impractical. 

Focusing on agent A (a similar procedure is followed for agent B), we calcu- 
late the distance d (^^^ ^ Qa^^^ using 

d'{:F-,QT)-lT.{^''i-^^)-QTir,s)y . (16) 

r.s 

Since and Q^j["* are both probability distributions, the distance d (y^'^ ■> 2a"^) 

must lie in the interval [0, 1] The value of the distance must be below an accep- 
tance threshold r]A that reflects the absorbative capacity of agent A, limiting 
what interaction sequences agent A accepts. 

The above considerations are used to define an update rule for the response 
disposition. Formally, this is 

QT^{l-kA)QT''+kAT^^' , (17) 

where the update rate kA is in the interval [0, 1]. The update rule in Eq. (fTT]) 
is applied if and only if d {j^^ ^ Q'a^^ <^ 

5.2 Communities of agents. 

In this work, we have focused on interactions between pairs of agents. The 
agent definitions are posed in a manner that generalizes readily to larger groups 
of agents. However, some extensions to the agents described in section 13.31 
are needed, in order to introduce rules by which agents select communication 
partners. 

A reasonable approach is to base rules for selecting interaction partners on 
affinities that the agents have for each other. Letting Qxy be the affinity that 
agent X has for agent Y, we set the probability that agent X and agent Y 
interact to be proportional to QxyGyx (the product is appropriate because 
the affinities need not be symmetric). A pair of interacting agents can then be 
chosen based on the combined affinities of all the agents present in the system. 

As a starting point, the affinities can be assigned predetermined values, 
defining a fixed network of agents. More interestingly, the affinities can be 
determined based on the agent memories, so that interaction produces a dy- 
namically changing network. A natural approach to take is to compare the 
behavior of an agent A with its expectation of the behavior of another agent B, 
as represented by the alter memory For example, agents can be con- 

structed to predominately interact with other agents they assess as similar by 
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defining 



where fc is a settable parameter and d is the distance defined in Eq. p4|) . The 
affinities can be viewed as the weights in a graph describing the agent network. 

The outcomes of interactions wih depend not only on agent preferences and 
strategies, but also on the patterns of relations — encoded by the affinity graph — 
within which the agents are embedded. The structure of the network may play 
an important role in influencing the global spread of information or facilitating 
collective action. The interplay between network architecture and dynamical 
consequences is not straightforward, since the properties of the graphs that are 
relevant to individual and collective behavior of the agents will depend on the 
nature of the dynamical process describing th e interactions. This pro cess can be 
related to prior studies of contagion (see, e.g.. lBlanchard et al. However, 



unlike classical epidemiological models, which assume spread of contagion to be 
a memory-free process, agents are affected by both past and present interactions 
amongst one another. The probability of "infection" can exhibit threshold-like 
behavior, with the probability of adopting some behavioral pattern changing 
suddenly after some finite number of exposures. 



6 Summary 

In this work, we have introduced an agent-based model of interaction. The 
model draws on concep ts from Luhmann's contingen cy approach, and builds 
on an earlier model by IPittrich. Kron. and B anzhaA Agent interactions are 
formulated as the exchange of distinct messages that are selected based on 
individual agent properties and the interaction history. 

We expressed the model mathematically, in a form suitable for handling an 
arbitrary number of agents. Agent memories are formulated using a number of 
probability distributions. Two types of memories, th e ego and alter memories, 
are similar to the memories used by IPittrich et al. , but we also introduced a 



response disposition showing individual preferences for the agents. A selection 
probability for messages is calculated from the memories, along with a random- 
izing term allowing for invention. 

The selection probabilities are used to determine which messages the agents 
exchange, but the selection probabilities themselves depend on the ego and alter 
memories, which record the messages exchanged. To analyze the properties of 
the model, we focused on pairs of agents. Using analytic methods and computa- 
tional simulations, we explored the asymptotic properties of the agent memories 
and selection probability as the number of messages exchanged becomes very 
large, identifying some conditions for the existence and uniqueness of stable 
asymptotic solutions. Additionally, we investigated numerically the rates of 
change in shorter interaction sequences. 

Finally, we sketched out intended extensions of the model. We described how 
the response dispositions of the agents might be updated and suggested how 
to handle larger groups of agents. In particular, these two extensions appear 
sufficient for investigating how inventions can be either adopted or rejected by 
the community of agents. Since an agent uses its response disposition with 
any of the other agents, the response disposition serves as a sort of general 
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behavioral preference, in the form of messages and transitions between messages. 
By tracking changes to the response disposition, we can distinguish between 
successful and unsuccessful inventions. In networks of agents, this approach 
would allow us to investigate how various network structures might facilitate or 
hinder the spread of inventions. This will be the subject of future research. 
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A Mathematical analysis. 

In this appendix, we develop the mathematical aspects summarized in sec- 
tion 14.11 The formalism presented here deals with the two-agent case, but 
some aspects generalize to the multi-agent case. We are mainly interested in 
the asymptotic behavior of the model, where the interaction between the two 
agents is infinitely long. Though this limit is not reached in concrete situations, 
it can give us some idea of the system behavior for sufficiently long sequences. 
The asymptotic results essentially apply to the finite case, provided that the 
length of the sequence is longer than the largest characteristic time for the 
transients (given by the spectral gap; see section EH . 

A.l Basic statements. 

Recall that W^*^^ {r \ s) is the conditional probability that agent A selects mes- 
sage r at time t given that agent B selected s at time t — 1. It is given by 



Eq. O, 




A^'^Qa (r |.)-f 4-^)4*/^ (r|.)-fcl 



■^^n (. I .) -f 




(19) 



where TL ^^i^ (• | r) is 



ns 

n [a%^ (• k)] = - E -^AlB (r I r) log„, {a^^I ir' \ r)) . (20) 



r' = l 



Note that this term does not depend on s. The normalization factor M^-g is 
given by 
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and does not depend on time when c^^^ = 0. Moreover, 



l-4-'^<A/l|^s<l + 4-'^("s-l) . (22) 
It wiU be useful in the following to write l^^^^g {r \ s) in matrix form, so that 



U 



A\B .At) 
■'^ A\B 



with a corresponding equation for the U^j^. 
conditional probability, with the form 



1 



1 



S = — 

ns 



MB 



(23) 



The matrix S is the uniform 



(24) 



We use the notation H 



jAt) 



for the matrix with the r, s element given by 



■^A\B 



For simplicity, we assume here that agent B always starts the interaction. 
This means that agent B selects the message at odd times t and agent A selects 
at even times. When it does not harm the clarity of the expressions, we will drop 
the subscripts labeling the agents from the transition probabilities and call U'*^ 
the transition matrix at time with the convention that U^^"-* = U^i'^ and 

U(2n+i) ^ XJ^'^^"*. Where possible, we will further simplify the presentation by 
focusing on just one of the agents, with the argument for the other determined 
by a straightforward exchange of the roles of the agents. 

A. 2 Dynamics. 

Call ■p'^*^ (r) the probability that the relevant agent selects message r at time 
t. The evolution of P^*) (r) is determined by a one parameter family of (time 
dependent) transition matrices, like in Markov chains. However, the evolution 
is not Markovian since the transition matrices depend on the entire past history 
(see section IX3|) . 

Denote by m — mi, TO2, • • ■ , n^t-i, fnt, ... an infinite message exchange se- 
quence where mt is the t-th exchanged symbol. The probability that the finite 
subsequence mi, m2, . . . , rrit-i, mt has been selected is given by 



Prob [mt, m(_i, . . . ,TO2,mi] = 

ll^''> (mt I mt_i)Zi(*-i) {mt-i \ m^.a) 



.W^^^ (to2 I mi)p(^^ (mi) (25) 



Thus, 



p(t) ^ u(*)u(*"^)u(*"^) • . .u(2)p(^' 



(26) 



Since agent B selects the initial message, we have V'^^^ (r) = X]l=i 2b {r, s). As 
well, the joint probability of the sequential messages mt and m(_i is 



Prob[mt,mt_i] =W'*)(mt | mt-i)r'^*-^^ (mt-i) 



(27) 
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Note that the U*^*) depend on the response dispositions. Therefore one can at 
most expect statistical statements referring to some probabihty measure on the 
space of response disposition matrices (recall that the selection probability for 
the first message is also determined by the response disposition). We therefore 
need to consider statistical averages to characterize the typical behavior of the 
model for fixed values of the coefficients c|'^'' and c\^^ . 

A. 3 Existence and uniqueness of a stationary state. 

In this section, we briefly discuss the asymptotic properties of the ME model 
considered as a stochastic process. The main objective is to show the existence 
and uniqueness of a stationary state, given that the probability for a given 
message to be selected at time t depends on the entire past (via the ego and 
alter memories). In the case where €2 = 03 = 0, the ME model is basically a 
Markov process and it can be handled with the standard results in the field. 

In the general case, one has first to construct the probability on the set of 
infinite trajectories where A — {1,2,... 715} is the set of messages. The 
corresponding sigma algebra J- is constructed as usual with the cylinder sets. 
In the ME model the transition probabilities corresponding to a given agent 
are determined via the knowledge of the sequence of message pairs (second or- 
der approach), but the following construction holds for an n-th order approach, 
n > 1, where the transition probabilities depends on a sequence of n-tuples. 
If mi, TO2, . . . , mt-i, nif is a message sequence, we denote hy lui,uj2, ■ ■ - ^^k the 
sequence of pairs uji = (mi, TO2), ^2 = {ms, 7114), ... uJk = i'm2k-i,'m2k)- De- 
note by oj™ = {wi . . .uJm)-, where m > I. We construct a family of conditional 
probabilities given by 

P [ujt = (r,s)|w^i] = ciQ(r,s)+C2f [uJt = (r, s)|w*-i] +C3F (A [• | (r)+C4 

(28) 

for t > 1, where 

1 

£ [ujt - (r, s)\lo\-^] =-Y. ^(^fe = ('^' ' (29) 

k=l 

with x() being the indicatrix function {A [■ | lo\~^\ has the same form, see sec- 
tion [3Xl|). For i = 1 the initial pair wi is drawn using the response disposition 
as described in the text. Call the corresponding initial probability /i. 
We have 

p KT+iki] = P K+iK] P ...P (30) 

and, Vm, ^ > 1, 

p K'^Vi] p Krvr'] = p ■ (31) 

From this relation, we can define a probability on the cylinders {ij-'™"'"'} by 

^['^™+'] =E^Kn+'MM^i) • (32) 
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This measure extends then on the space of trajectories (^A^ , T) by Kolmogorov's 
theorem. It depends on the probabihty of choice for the first symbol (hence it 
depends on the response disposition of the starting agent). A stationary state 
is then a shift invariant measure on the set of infinite sequences. We have not 
yet been able to find rigorous conditions ensuring the existence of a stationary 
state, but some arguments are given below. In the following, we assume this 
existence. 

Uniqueness is expected from standard ergodic argument whenever all tran- 
sitions are permitted. This is the case when both C4 > and C3 > (the case 
where C3 is negative is trickier and is discussed below). When the invariant state 
is unique, it is ergodic and in this case the empirical frequencies corresponding 
to alter and ego memories converge (almost surely) to a limit corresponding 
to the marginal probability obtained from Eq. (jS^ . Thus, the transition 
matrices U^*^ also converge to a limit. 

In our model, uniqueness basically implies that the asymptotic behavior of 
the agent does not depend on the past. This convergence can be interpreted as 
a stabilization of the behavior of each agent when interacting with the other. 
After a sufficiently long exchange of messages, each agent has a representation 
of the other that does not evolve in time (alter memory). It may evaluate the 
probability of all possible reactions to the possible stimuli, and this probability 
is not modified by further exchanges. Its own behavior, and the corresponding 
representation (ego memory), is similarly fixed. 

In the next section we derive explicit solutions for the asymptotic alter and 
ego memories. We are able to identify several regions in the parameters space 
where the solution is unique. But it is not excluded that several solutions exist 
in other regions, especially for C3 < 0. In this case, the long time behavior of 
the agent depends on the initial conditions. A thorough investigation of this 
point is, however, beyond the scope of this paper. 

Note that the kind of processes encountered in the ME model has interestinj 
relations with the so-called "chains with complete connections" (see [Maillar 



20031) for a comprehensive introduction and detailed bibliography). There is in 



particular an interesting relation between chains with complete conne ctions and 
the Dobrushin-Landford-RucUc construction of Gibbs measures ()Maillard, 20031 ). 
Note, however, that the transition probabilities in our model do not obey the 
continuity property usual in the context of chains with complete connections. 



A. 4 Transients and asymptotics. 

The time evolution and the asymptotic properties of Eq. (|26p are determined 
by the spectral properties of the matrix product U'^*^U(*^^)U'^*~^'' . . . U^^-'. As- 
sume that the U*-*^ converge to a limit. Then, since the U^*^ are conditional 
probabilities, there exists, by the Perron-Frobenius theorem, an eigenvector as- 
sociated with an eigenvalue 1, corresponding to a stationary state (the states 
may be different for odd and even times) . The stationary state is not necessarily 
unique. It is unique if there exist a time such that for all t > to the matrices 
U^*^ are recurrent and aperiodic (ergodic). For C3 > and C4 > 0, ergodicity 
is assured by the presence of the matrix S, which allows transitions from any 
message to any other message. This means that for positive C3 the asymptotic 
behavior of the agents does not depend on the history (but the transients depend 
on it, and they can be very long). 
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For C3 < 0, it is possible that some transitions allowed by the matrix S are 
canceled and that the transition matrix U^*-* loses the recurrence property for 
sufficiently large t. In such a case, it is not even guaranteed that a stationary 
regime exists. 

The convergence rate of the process is determined by the spectrum of the 
U^*^ and especially by the spectral gap (distance between the largest eigenvalue 
of one and the second largest eigenvalue). Consequently, studying the statistical 
properties of the evolution for the spectrum of the U'*' provides information 
such as the rate at which agent A has stabilized its behavior when interacting 
with agent B. 

The uncertainty term and the noise term play particular roles in the evolu- 
tion determined by Eq. (|23p . First, the matrix S entering in the noise term has 
eigenvalue with multiplicity ns — 1 and eigenvalue 1 with multiplicity 1. The 
eigenvector corresponding to the latter eigenvalue is 



u 



1 

ns 



(33) 



V 1 / 



which is the uniform probability vector, corresponding to maximal entropy. 
Consequently, S is a projector onto u. 

Second, the uncertainty term does not depend on the stimulus s. It cor- 
responds therefore to a matrix where all the entries in a row are equal. More 



precisely, set a 



it) _ 



n 



•^aIb i' \ ^) ■ Then one can write the corresponding 



matrix in the form nsv'-*-'u'^, where v^*) is the vector 



J*) 



(34) 



and u""" is the transpose of u. It follows that the uncertainty term has a 



eigenvalue with multiplicity n — I and an eigenvalue X^T^i ^ "^aib (' I 



with 



corresponding eigenvector v*- •'. It is also apparent that, for any probability 
vector P, we have H A^l^ 



(t Tn 

= nsvyuP = V 
The action of U^*' on P^*) is then given by 



(t) 



>(t+i) _ 



U 



it) p(t) 



M) 



^(A)g(t) 



,(t) 



M)^w 



(35) 



The expression in Eq. warrants several remarks. Recall that all the vectors 
above have positive entries. Therefore the noise term c'/^'^u tends to "push" P^'* 
in the direction of the vector of maximal entropy, with the effect of increasing 
the entropy whatever the initial probability and the value of the coefficients. 
The uncertainty term Cg'^^'v^^ plays a somewhat similar role in the sense that it 
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also has its image on a particular vector. However, this vector is not static, in- 
stead depending on the evolution via the alter memory. Further, the coefficient 
Cg"^^ may have either a positive or a negative value. A positive Cg"^^ increases 
the contribution of v^^ but a negative Cg'^^ decreases the contribution. Conse- 
quently, we expect drastic changes in the model evolution when we change the 
sign of c'^^ — see section and especially Fig. E] for a striking demonstration 
of these changes. 



A. 5 Equations of the stationary state. 

The influence of the coefficients c^^^ and c\^^ is easier to handle when an asymp- 
totic state, not necessarily unique and possibly sample dependent, is reached. In 
this case, we must still distinguish between odd times (agent B active) and even 
times (agent A active), that is, P^*-* has two accumulation points depending on 
whether t is odd or even. Call v''^^ (m) the probability that, in the stationary 
regime, agent A selects the message m during interaction with agent B. In the 
same way, call V'^^ (r, s) the asymptotic joint probability that agent A responds 
with message r to a stimulus message s from agent B. Note that, in general, 
■Pa^ ir, s) ^ Vi°^\j-, s), but rlr^ (mH Es ^A^J (m, s) = V'-^J (r, m) and 
(m) = Eg 'P^BA ('^' = X^r '^AB since each message simultaneous 

is both a stimulus and a response. 

Since alter and ego memories are empirical frequencies of stimulus-response 
pairs, the convergence to a stationary state implies that ^) converges 

to a limit (r, s) which is precisely the asymptotic probability of stimulus- 
response pairs. Therefore, we have 



(36) 
(37) 



Thus, the u'^,'!] converge to a limit U\^^, where 



(oo) 



u 



(oo) 
A\B 



■^^A\B 



I Ci ^4a + C2 rj 



(oo) 

a\b 



,(oo) 
^A\B 



(38) 



From Eq. (gT]), we have V 



(s). Hence, 



tt(°o)tj(°°) 

■py^l _ tt(°o)tj(oo) 
— '-^BIA^A 



,(oo) 
A 

,(oo) 



(39) 
(40) 



and 



,(oo) _ ^y(oo) jj(oo)|-,(oo) 

A - 

,(oo) _ -jy(oo)|.^(oo)|-,(oo) 

B — ^ B\A^ A\b'^ B 



(41) 
(42) 



It follows that the asymptotic probability is an eigenvector of UJ^U^^ 

corresponding to the eigenvalue 1. We will call this eigenvector the first mode 
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of the corresponding matrix. Therefore, the marginal ego memory of agent A 
converges to the first mode of U^^U^^. A numerical example is provided in 

section mnm 



Combining Eqs. ([271) and ([361), V^^^ (r, s) = U"^^ (r \ s) V)^' (s) = S"^' (r, s). 



(oo) 



,(oo) 



,(oo) 



But (5) 



(r,^) = E.4B^ {r,s), SO 



£ 



(00) 



<i ir I 



Therefore, using the relation {r \ s) = ('^ I we have 



(43) 
(44) 



Q^(r|,s) + 4^'4°°i(r^s)+cr7^ 



Qs (r I s) 



4Ti (• I 



4Ti (• I 



(^) 



(45) 



(46) 



A. 6 Solutions of the stationary equations. 

Define 



(A) 



With this definition, Eqs. (gS]) and become: 



4ri 1 ^) 



4Ti (• I o_ 

4ri (• 1 0' 



(47) 

(48) 
(49) 



We next plug Eq. (^5]) into Eq. 



After some manipulation, we obtain 



17(00) 



{r\s) 



PrQA {r I s) 



(B) 



p'Pn £^^^ (. I /) 



/3rE(^i 2B(r'|r)+/3f))log„, 1 



(00) 



/Sp^Qs (r' |r)+/3f) 



4ri (• I r') 



^A\B 



log,, 



/3f (• I r') 



M^'^Qs (r' |r)+/3f^ 



(50) 



which uncouples the expression for 4^ ('^ I ^) that for 4^ I 

some sense, Eq. (ISO]) provides a solution of the model with two agents, since it 
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captures the asymptotic behavior of the ego and alter memories (provided the 
stationary regime exists). However, the solution to Eq. 150]) is difficult to obtain 
for the general case and depends on all the parameters c-^-* and cp-*. Below, 
we discuss a few specific situations. 

(A) 

This form has the advantage that it accommodates series expansion in Cg . 
However, /Jg^-* depends on via the normalization factor from Eq. (pij) and 
high order terms in the expansion are tricky to obtain. Despite this, the bounds 
given in Eq 

us to characterize the behavior of the model when c]^^' changes its sign. This is 
of principle importance, since we shall see that a transition occurs near C3 — 0. 
It is clear that the first term in Eq. ([5D]) is of order zero in /3^^^ , the second term 
is of order one, and the remaining terms are of higher order. 



(A) (A) 

ensure that /Jg is small whenever Cg is small. This allows 



When c' 



(A) 



0, agent A does not use its alter memory in response selection. 



Its asymptotic ego memory is a simple function of its response disposition, with 
the form 

M^^QA(r + . (51) 



^A\B 

ovv,oU 



Sir^ (r I s) 



When Cg is small, o^^^ {r \ s) becomes a nonlinear function of the conditional 
response disposition for agent B, so that 



4Ti (H = Pi^^ Qa {r\s)+ /3f ) + /3f [/3i^) Qb (■ | r) + /3f (52) 



An explicit, unique solution exists in this case. 



For small values of c 
is proportional to Ti. 



(A) 
3 



the derivative of f^^e I ^) '^ith respect to c 



(A) 



Note that the derivative depends 



nonlinearly on the coefficients c\ and c\ , and that, therefore, the level lines 



n 



— C depend on c^^'' and c)^' (see Fig. [5] where the 



levels lines are drawn — they do not coincide with C3 = 0). This slope can be 
steep if the uncertainty in agent iJ's response disposition is high. For example, 



if 



^ is small, the entropy is high. There exists therefore a transition, possibly 
0. 

(A) 



sharp, near Cg'^'' 



As further increases, we must deal with a more complex, nonlinear 
equation for the ego memory. In the general case, several solutions may exist. 

A simple case corresponds to having c^^^ — Cg^-* — 1. Indeed, in this case 
we have 



P^Pn £^7^ (. I r') 



4ri 1 ^) 



''A\B 



(• I r') 



(~) 



A\B 



(53)' 

The right hand side is therefore independent of r and s. It is thus constant and 
corresponds to a uniform ('^ I *). Hence, using Eq. the asymptotic 

selection probability is also uniform and the asymptotic marginal probability 
of messages 'Pjj°°^ (r) is the uniform probability distribution, as described in 



section [X31 Consistent with the choice of the coefficients, Vj^ 
entropy. 



{00) 



(r) has maximal 
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Next, note that if C2 is large (but strictly lower than one), convergence to 
the stationary state is mainly dominated by the ego memory. However, since 
the ego memory is based on actual message selections, the early steps of the 
interaction, when few messages have been exchanged and many of the S'^^ (r, s) 
are zero, are driven by the response disposition and by the noise term. However, 
as soon as the stimulus-response pair (r, s) has occurred once, the transition 
probability lyl'^^Q (r \ s) will be dominated by the ego term U^^^^g {r \ s) and the 
agent will tend to reproduce the previous answer. The noise term allows the 
system to escape periodic cycles generated by the ego memory, but the time 
required to reach the asymptotic state can be very long. In practical terms, 
this means that when C4 is small and C2 is large for each of the two agents, the 
interaction will correspond to metastability, with limit cycles occurring on long 
times. Also, though there exists a unique asymptotic state as soon as C4 > 0, 
there may exist a large number of distinct metastable state. Thus, when C2 
is large for both agents, we expect an effective (that is on the time scale of a 
typical interaction) ergodicity breaking with a wide variety of metastable states. 

To summarize, the roles of the various coefficients in Eq. (fT9|) are: 

• ci emphasizes the role of the response disposition. When ci is nearly one 
for both agents, the asymptotic behavior is determined by the spectral 
properties of the product of the conditional response disposition. 

• C2 enhances the tendency of an agent to reproduce its previous responses. 
It has a strong influence on the transients and when C2 is large, many 
metastable states may be present. 

• C3 drives the agent to either pursue or avoid uncertainty. A positive C3 
favors the increase of entropy, while a negative C3 penalizes responses 
increasing the entropy. 

• C4 is a noise term ensuring ergodicity, even though the characteristic time 
needed to reach stationarity (measured using, e.g., the spectral gap in the 
asymptotic selection probability matrices) can be very long. 
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(a) agent A 




(b) agent B 



Figure 1: Block-structured response dispositions. The size of the circle is pro- 
portional to the value of the corresponding entry. 
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Figure 2: Evolution of the distance between the vector f^*-* (r) and the first 

t(oo)tt(oo) 

'aib^bia 



mode of \J^?^'l\j'\^l as the number of interaction sequences increases. 
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Figure 3: Evolution of the joint entropy of alter and ego memory with infinite 
(A(-^) = A(^) = 1) and finite (A^-^) = A^^^ = 0.99) memories. 
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Alter 

Cj=0.99; c,=0; 0^=0; c^=0.01 



Ego 

Cj=0.99; 0^=0; c^=0; c=Qm 




Figure 4: Average asymptotic memories for agent A after 128000 steps, with 
infinite memories. The matrices shown here are calculated by averaging over 
10 initial conditions. The sizes of the red circles are proportional to the corre- 
sponding matrix elements, while the sizes of the blue squares are proportional 
to the mean square deviations. 
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Alter 
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Figure 5: Average asymptotic memories for agent A after 128000 steps, with 
finite memories (A^'^-' = A^^-' = 0.99). The matrices shown here are calculated by 
averaging over 10 initial conditions. The sizes of the red circles are proportional 
to the corresponding matrix elements, while the sizes of the blue squares are 
proportional to the mean square deviations. 
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(a) Infinite memory (A(-^) = A(£) = 1). 




(b) Finite memory (A(-^) = X(^^ = 0.99). 

Figure 6: Asymptotic joint entropy for alter memory of agent A, with (a) infinite 
memories and (b) finite memories. Tlie plane represents C3 — 0. The colored 
lines in the C1-C2 plane are level lines for the joint entropy, while the black line 
shows where the C3 = plane intersects the C1-C2 plane. 
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Figure 7: Banded response disposition. The size of the circle is proportional to 
the value of the corresponding entry. 
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Figure 8: Rates of change for the aher memory of the student agent approaching 
the response disposition of the teacher agent. The colored Hues in the C1-C2 plane 
are level lines for the rates. 
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